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A prototypical problem on which techniques for exact enumeration are tested and compared is the 
enumeration of self-avoiding walks. Here, we show an advance in the methodology of enumeration, 
making the process thousands or millions of times faster. This allowed us to enumerate self-avoiding 
walks on the simple cubic lattice up to a length of 36 steps. 

I. INTRODUCTION 

5^ , According to renormalization group theory, the scaling properties of critical systems are insensitive to microscopic 
Oh 1 details and are governed by a small set of universal exponents. Polymers can be considered as critical systems in the 
limit where their length N (the number of chained monomers) grows For instance, the free energy of an isolated 
^» ■ polymer in a swollen phase behaves asymptotically as exp(— F/v) = -Zjv ~ Afi N N e . Here, the connectivity constant 
\x and the amplitude A are non-universal (model-dependent) quantities. The exponent 9, however, characterizing the 
leading correction to the scaling behavior, is believed to be universal; it is related to the entropic exponent j s = 9 +1. 
The average squared distance between the end points of such polymers scales as N 2u , where v w 0.588 in three 
dimensions is also a universal critical exponent. 

Universal exponents such as 9 and v can be measured most accurately in computer simulations of the most rudi- 
\ mentary models in the universality class of swollen polymers, which arguably is that of self-avoiding walks (SAWs) 
on a lattice. Estimates of these exponents can be obtained by counting the number Zj^ of SAWs of all lengths up 
to Af max , and calculating the sum P/v of their squared end-to-end extensions, which scales as P/v ~ ZnN 2v . The 
exponents can then be obtained from 
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f~>) \ respectively, in the limit of increasing N. In Eq. (p}, the values of N are taken a distance two apart, so that 
the formula involves either only even N, or only odd N; this is more accurate than mixing even and odd values. 
Similar considerations lead to Eq. The accuracy of the estimates improves significantly with increasing A^ max , but 
unfortunately at the expense of an exponentially growing number of walks. In two dimensions, various algorithmic 
improvements have allowed for the enumeration of all SAWs up to N max = 71 steps Q, but these methods cannot be 
used effectively in three dimensions, which is the most relevant dimensionality for practical purposes. Hence, to date, 
the enumeration of three-dimensional SAWs stops at Af max — 30 steps [3fl. 

Counting SAWs has a long history, see e.g. Q. In a paper by Orr [5| from 1947, Zn was given for all N up to 
]V max = 6; these values were calculated by hand. In 1959, Fisher and Sykes @ enumerated all SAWs in 3D up to 
]V max = 9 using a computer. More recently, in 1987 Guttmann [3 enumerated longer SAWs up to N max = 20, and 
extended this by one step in 1989 0. In 1992, MacDonald et al. reached N max = 23, and in 2000 MacDonald et 
al. [l0| reached Af max = 26. 

Here, we present the new length-doubling method which allowed us to reach N max = 36 using 50,000 hours of 
computing time, a result that would have taken roughly fifty million hours with traditional methods, or alternatively 
we would have to wait another 20 years by Moore's law (which states that the number of transistors on a computer 
chip doubles every two years) before we could undertake the computation. 



II. LENGTH-DOUBLING METHOD 



In the length-doubling method, we determine for each non-empty subset 5* of lattice sites the number Zn(S) of 
SAWs with length and originating in the origin, that visit the complete subset. Let \S\ denote the number of sites 
in S. The number Z^n of SAWs of length 2Af can then be obtained by the length-doubling formula 
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This equation can be understood as follows. Let N > 1 be fixed. Let Ai be the set of pairs («, w) of SAWs of 
length N that both pass through lattice point i. Here, a walk v starts in 0, and then passes through V\, . . . , vat. Since 
the distance reached from the origin is at most N, there exist only finitely many non-empty sets Ai. Then, the total 
number of SAWs of length 2N equals 



Z2N — Z N — 
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because every pair (v,w) of the possible pairs can be used to construct a SAW of length 2N, except if v and w 
intersect in a lattice point i. The resulting walk 



(v, w) = (Vpf-l -v N ,...,vi- v N , -v N , Wi - v N , . . . , w N - V N ) 



(5) 



of length 2N is obtained by connecting the two walks in and translating the result over a distance — u/v- The new 
starting point is then the translated old end point of v and the new end point is the translated old end point of w. 
Note that from a SAW of length 2N we can also create a non- intersecting pair (v, w) by using ((3J), so that indeed we 
have a bijection between such pairs and SAWs of length 2N. 

The inclusion-exclusion principle from combinatorics, see for instance (llT Chapter 10], states that 
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for the union of n sets Ai. We can apply this principle, noting that for a non-empty set S = {i±, . . . ,i r } the 
intersection A^ n - * - PI A ir . has Zff(S) elements, where Zm{S) is defined as the number of SAWs of length N that pass 
through all the sites of S. The sign of the term corresponding to the set S in the expansion ^ is (— l) r+1 , where 
r = \S\. Substituting this in Eq. and combining with Eq. Q yields the length-doubling formula Eq. ([3]). The 
length-doubling method is illustrated by Fig. [T] 



III. APPLICATION OF THE LENGTH-DOUBLING FORMULA 



The usefulness of this formula lies in the fact that the numbers Zn(S) can be obtained relatively efficiently: 

• Each SAW of length N is generated. 

• For each SAW, each of the 2^ subsets S of lattice sites is generated, and the counter for each specific subset 
is incremented. Multiple counters for the same subset S must be avoided; this can be achieved by sorting the 
sites within each subset in an unambiguous way. 

• As the last step, the squares of these counters are summed, with a positive and negative sign for subsets with 
an even and odd number of sites, respectively, as in Eq. ([3]). 

With Zjss walks of length N, each visiting 2 N subsets of sites, the computational complexity is 0(2 N Zn) ~ (2/J-) N 
times some polynomial in N which depends on implementation details. This compares favorably to generating all 
Z2N ~ A* 2Ar walks of length 2N, as fi w 4.684 on the simple cubic lattice. 

A practical problem which is encountered already at relatively low N, is the memory requirement for storing the 
counters for all subsets. An efficient data structure to store these is based on a tree structure. The occurrence of a 
subset {a, b, c, d, e}, in which a, 6, c, d, and e are site numbers ordered such that a<b<c<d<e, is stored in the 
path a— > b — > c — > d —> e, where a is directly connected to the root of the tree and e is a leaf. 

We added two further refinements to the method sketched above. First, we exploit symmetry. Two subsets Si 
and 5*2 which are related by symmetry will end up with the same counter. One can therefore safely keep track of 
the counter belonging to only one subset S out of each group of symmetry-related subsets. This reduces the memory 
requirement by a factor close to 48 (slightly less because of subsets with an inherent symmetry); in practice, the 
computational effort goes down by a similar factor. 



FIG. 1: Illustration of the length-doubling algorithm, using a small subset of three walks of length N = 18. Ignoring 
intersections, there are Z = 3 candidates for SAWs of length 36: the blue-red, blue-orange, and red-orange combinations. 
Ignoring double counting, Z should be reduced by 3 because of the intersections a = (2, 3, 1), b = (2, 0, 0), and c = (0, —2, 0). 
Correcting for double counting because of the pair of sites S = {a, b}, the number of self-avoiding combinations is thus 
3 — 3 + 1 = 1. Indeed, only the red-blue combination is self-avoiding. Using a computer, we applied this approach to 
combinations of all walks of length N = 18. 

The second refinement is tree splitting. Rather than computing the full tree, we split the tree into non-overlapping 
subtrees, using for instance as a criterion the value of the site with the highest number. Another criterion is the 
subset size This splits up the summation in Eq. (J3|) into independent sums, which can be computed in parallel. 

With the length-doubling method, it is also possible to compute the squared end-to-end distance, summed over all 
SAW configurations. The squared end-to-end distance for walks of length N is denned by 




(7) 



where the sum is taken over all the SAWs of length N, and ||tfjv|| is the Euclidean distance of the end point w?{ of 
walk w from the origin. 
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The length-doubling formula for the squared end-to-end distance then becomes 

P 2N = 2Z N P N + 2]T(-1)I S I (Z N (S)P N (S) - \\E N (S)\\ 2 ) . (8) 

Here, Pn{S) is the total squared end-to-end distance for all walks of length N that pass through the complete set S, 
and the extension En(S) is defined as the sum of wn for all such walks w. This formula can be understood again 
by using the inclusion-exclusion principle, but now generalised to add (squared) distances for sets Ai instead of just 
counting numbers of elements. The first term of the right-hand side of Eq. ([8]) is obtained by computing 

^2 \\ w n - v N \\ 2 = ^ (| |wjy|| 2 + ||^Af|| 2 - 2v N ■ w N ) 

(v.w) (v,w) 

W V V w 

= 2Z N P N , (9) 

where the inner product vanishes because of the symmetry between v and —v. For walks passing through S a similar 
derivation holds, but now the inner product does not vanish, and instead gives rise to the term ||_Etv(S')|| 2 . Computing 
P2N by this formula requires additional counters for each subset S, namely for the total extension in the x-, y- and 
^-directions, as well as for the total squared extension Pn(S). 



IV. RESULTS 



With length-doubling, we obtained Z N up to Z 36 = 2 941 370 856 334 701 726 560 670, with a squared end-to-end 
extension of P 36 = 230 547 785 968 352 575 619 933 376. All values of Z N and P N for N < 36 are given in Table Q] 
In a first analysis of the results, we fitted the data for N = 18, . . . , 36 with the functions 

Z N « A^N^ (l + ciiV- Al +h(-l) N N- ai ) , 

Pn « A 2 ^N 62 (l + c 2 N- A * + k 2 1 + ( "~^ N N- a ^J , (10) 

by minimizing e\ = Y^i=is {^°& z i ~ 1°§ z i) anc ^ e 2 = Sf=i8 (^°S ^ — ^) with respect to the fitting parameters. 
The different treatment of the oscillatory corrections, (— l) w vs. (l + (— l) N )/2, gave us a slightly better fit, but we 
do not attribute much physical significance to this. The resulting fit values are presented in Table [TTJ Note that 
the resulting values for e\ and e 2 are small, indicating that the deviations between the exact number log Zi and the 
approximation log Zn only show up in the ninth or tenth significant digit. 

From a physics perspective, the most interesting observables are the growth exponent and the entropic exponent 
7 a . The fitting procedure yields v = (9 2 — 0\)/2 — 0.593, which is to be compared with the Monte Carlo result 
v = 0.587597(7) as obtained by Clisby 12]. The fitting procedure also yields j s = 0\ + 1 = 1.1597, which is not 
too far from the literature value 7^ = 1.1573(2) as obtained by Hsu et al. [IH using the pruned-enriched Rosenbluth 
method. Deviations between our results and those in the current literature are likely related to the fact that we 
observe finite-size corrections with correction exponents Ai ~ 1.4 and A2 ~ 0.26, while the literature values for v and 
7 S are obtained under the assumption of a shared correction exponent of A w 0.46. A more refined analysis of our 
enumeration data, along the lines of Ref. Q, is required to investigate the nature of the finite-size corrections. Such 
an analysis will probably also yield slightly different values for v and "f s . 

In the near future, we will apply our new approach for exact enumeration to other lattices such as face-centered- 
cubic and body-centered-cubic, and generalize it to various other models in polymer physics, such as confined and 
branched polymers, and to various other models in statistical physics. 
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TABLE I: Enumeration results on the number of three-dimensional self-avoiding walks Zn and the sum of their squared 
end-to-end distances Pjv- 
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TABLE II: Fitted values for the parameters in Eq. ([i"0|) . 
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